Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python 2/e

Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python 2/e

作者: Bruce Peter Andrew Gedeck
出版社: O'Reilly
出版在: 2020-06-02
ISBN-13: 9781492072942
ISBN-10: 149207294X
裝訂格式: Quality Paper - also called trade paper
總頁數: 368 頁





內容描述


Statistical methods are a key part of data science, yet few data scientists have formal statistical training. Courses and books on basic statistics rarely cover the topic from a data science perspective. The second edition of this practical guide--now including examples in Python as well as R--explains how to apply various statistical methods to data science, tells you how to avoid their misuse, and gives you advice on what's important and what's not.
Many data scientists use statistical methods but lack a deeper statistical perspective. If you're familiar with the R or Python programming languages, and have had some exposure to statistics but want to learn more, this quick reference bridges the gap in an accessible, readable format.
With this updated edition, you'll dive into:

Exploratory data analysis
Data and sampling distributions
Statistical experiments and significance testing
Regression and prediction
Classification
Statistical machine learning
Unsupervised learning


作者介紹


Peter Bruce is the Founder and Chief Academic Officer of the Institute for Statistics Education at Statistics.com, which offers about 80 courses in statistics and analytics, roughly half of which are aimed at data scientists. He has authored or co-authored several books in statistics and analytics, and he earned his Bachelor's degree at Princeton, and Masters degrees at Harvard and the University of Maryland.
Andrew Bruce, Principal Research Scientist at Amazon, has over 30 years of experience in statistics and data science in academia, government and business. The co-author of Applied Wavelet Analysis with S-PLUS, he earned his bachelor's degree at Princeton, and PhD in statistics at the University of Washington
Peter Gedeck, Senior Data Scientist at Collaborative Drug Discovery, specializes in the development of machine learning algorithms to predict biological and physicochemical properties of drug candidates. Co-author of Data Mining for Business Analytics, he earned PhD's in Chemistry from the University of Erlangen-Nürnberg in Germany and Mathematics from Fernuniversität Hagen, Germany




相關書籍

Devops in Python: Infrastructure as Python

作者 Zadka Moshe

2020-06-02

Hadoop大數據分析實戰

作者 [美] 斯里達爾?奧拉 李垚 譯

2020-06-02

無人機設計與開發實戰——基於Paparazzi的小型四旋翼(微課視頻版)

作者 蘇立軍 齊曉慧 主編 董海瑞 席雷平 朱紅娟 副主編

2020-06-02